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Abstract 



o 
o 

Basic path-matchings, introduced by Cunningham and Geelen (FOCS 1996), are a common 
generalization of matroid intersection and non-bipartite matching. The main results of this 
paper are a new algebraic characterization of basic path-matching problems and an algorithm 



of graph theory and combinatorial optimization [24] . Matroid intersection is another fundamental 
problem; its min-max characterization and efficient algorithms have had far-reaching consequences 



for constructing basic path-matchings in 0(71") time, where n is the number of vertices and u> 
is the exponent for matrix multiplication. Our algorithms are randomized, and our approach 
assumes that the given matroids are linear and can be represented over the same field. 

Our main results have interesting consequences for several special cases of path-matching 
■ problems. For matroid intersection, we obtain an algorithm with running time 0(nr w_1 ) = 

\ 0(nr 138 ), where the matroids have n elements and rank r. This improves the long-standing 

O ■ bound of 0(nr 162 ) due to Gabow and Xu (FOCS 1989). Also, we obtain a simple, purely 

algebraic algorithm for non-bipartite matching with running time 0{n u ). This resolves the 
central open problem of Mucha and Sankowski (FOCS 2004). 

> 

^ ! 1 Introduction 
7-h ; 

Non-bipartite matching is a fundamental problem that has played a pivotal role in the development 

O 

in both combinatorics and computer science [31]. Basic path-matchings, due to Cunningham and 
Geelen [11, 12], are an elegant generalization of these two classic problems. 

Cunningham and Geelen showed that maximum-weight basic path-matchings can be found in 
polynomial time via the ellipsoid method [12]. Later, path-matchings were generalized to even- 
factors in a class of digraphs known as "weakly-symmetric" . Cunningham and Geelen [10] devised 
a combinatorial algorithm to compute a maximum even- factor in such graphs. Combining this 
with a matroid (or valuated matroid [26]) intersection algorithm, they obtain an algorithm to 
compute a maximum (or maximum- weight) basic even-factor. Pap [28] considers even-factors in a 
class of digraphs known as "odd-cycle symmetric", and gives a combinatorial algorithm to find a 
maximum even- factor in such graphs. Finally, Takazawa [33] gives a strongly-polynomial algorithm 
to compute maximum- weight even-factors in odd-cycle symmetric graphs. 

Matroid intersection algorithms have a more storied history. Polynomial time algorithms for 
matroid intersection were developed in the 1970s by various authors [13, 14, 22]. The efficiency of 
these early algorithms was typically measured relative to an oracle for testing independence. Cun- 
ningham [9] later developed an efficient algorithm for intersection of linear matroids (those that can 
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be represented as a matrix), which is the broadest class of matroids with efficient representations. 
Cunningham's algorithm requires 0(nr 2 log r) time, where n denotes the size of the ground set 
and r bounds the rank of the matroids. Gabow and Xu [17, 18] obtained an improved bound of 
0(nr 1,62 ) through the use of fast matrix multiplication and quite technical arguments. However, 
their bound does not seem to be a natural one; Gabow and Xu explicitly pose the problem of 
improving their bound as an open question. 

Several variants and generalizations of matroid intersection have been considered, notably ma- 
troid matching. Surprisingly, matroid matching problems on general graphs are not solvable in 
polynomial time (in the oracle model) [24]. On the other hand, bipartite matroid matching prob- 
lems are much more tractable, and have close connections to matroid intersection [16]. It was shown 
by Edmonds [13, Theorem 81] that bipartite matroid matching reduces to matroid intersection, and 
therefore can be solved efficiently. Efficient algorithms for bipartite matroid matching were implicit 
in Aigner-Dowling [1] and explicit in Tomizawa-Iri [34]. 

The aforementioned algorithms for matroid intersection and bipartite matroid matching are all 
based on augmenting paths; indeed, so are most algorithms for these problems [1, 9, 14, 15, 16, 
17, 18, 22, 34]. In contrast, Lovasz [23] introduced a randomized algebraic approach for matroid 
matchings; this approach was employed and extended by Barvinok [3] and Camerini et al. [5]. This 
paper, building on results of Geelen [20], extends the algebraic framework to basic path-matching 
problems. 

Algebraic approaches have recently been used by Mucha and Sankowski to obtain efficient al- 
gorithms for matching problems in bipartite and general graphs [25]. For bipartite graphs, they 
consider the Edmonds matrix and apply a simple but elegant variant of the Hopcroft-Bunch Gaus- 
sian elimination algorithm [4]. For general graphs, their algorithm is much more complicated: it 
maintains the canonical partition of the graph using sophisticated data structures for testing dy- 
namic connectivity. Sankowski [30] also developed an RNC 5 algorithm for constructing perfect 
matchings that uses only 0(n w ) processors, yielding another sequential algorithm that uses only 
0(12^) time. However, this algorithm is based on sophisticated parallel subroutines for evaluating 
a polynomial at a matrix and for computing the characteristic polynomial of a matrix. 

1.1 Our results 

The main results of this paper are a new algebraic characterization of basic path-matching problems 
and a randomized algorithm for constructing basic path-matchings in 0{n u ) time, where n is the 
number of vertices and lo < 2.38 is the exponent for matrix multiplication [7]. Our approach 
involves two assumptions. First, as is common, we assume that the matroids associated with the 
basic path-matching problem are linear. Additionally, we make the mild technical assumption that 
any given pair of matroids are represented as matrices over the same field. Although there exist 
matroids for which this assumption does not hold (e.g., the Fano and non-Fano matroids), this 
assumption can be satisfied for the vast majority of matroids arising in applications. The regular 
matroids are those that are representable over all fields; this class includes the graphic, cographic 
and partition matroids. Many classes of matroids are representable over all but finitely many fields; 
these include the uniform, matching, and transversal matroids, as well as deltoids and gammoids 
[31]. Our results apply to any matroids from these classes. 

As a consequence, we obtain O^) randomized algorithms for constructing perfect path- 
matchings, maximum matchings in general graphs, maximum bipartite matroid matchings, and 
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maximum intersections of two matroids. Here n denotes the number of vertices in the graphs or, 
for matroid problems, the cardinality of the ground set. Our algorithms are purely algebraic and 
do not require use of sophisticated data structures or subroutines other than matrix multiplica- 
tion. Therefore our non-bipartite matching algorithm resolves the central open question from the 
paper of Mucha and Sankowski [25]. Their structural approach using the canonical partition is not 
necessary; a simple, but subtle, divide-and-conquer approach suffices. 

For matroid problems, we can obtain tighter bounds by also considering the rank r of the given 
matroids. For bipartite matroid matching, the bound improves only slightly: the time required is 
0(r?r w ~ 2 + r w+1 ), which is tighter than 0(n w ) when r = o(n w /^ +1 )). For matroid intersection 
problems, the improvement is considerable: we present an algorithm that requires only 0(nr w_1 ) 
time. This is essentially the best that one could hope for since computing the rank of a n x r matrix 
reduces straightforwardly to linear matroid intersection, and 0(nr w_1 ) is the best known bound 
for computing the rank. Interestingly, our matroid intersection algorithm resembles a well-known 
proof of the matroid intersection theorem, for which it is remarked that the "proof gives no hint 
of how to find the [matroid intersection] efficiently" [6, p289]. Now suppose that we consider only 
algorithms using naive matrix multiplication. Under this restriction, the best known algorithm 
for linear matroid intersection is Cunningham's, which uses 0(nr 2 log r) time. The Gabow-Xu 
algorithm, despite its much more sophisticated techniques, also achieves the same bound. Our 
algorithm requires only 0(nr 2 ) time, under the assumption that the field over which the matroids 
are represented is sufficiently large. 

1.2 Notation and Basic Facts 

The set of integers {1 , . . . , m} is denoted [m] . If J is a set , J + i denotes J U {i} . The z th elementary 
vector is denoted e,, that is, ej is 1 in the i th component and zero elsewhere. If M is a matrix, 
a submatrix containing rows S and columns T is denoted M[S,T]. A submatrix containing all 
rows (columns) is denoted M[*,T] (M[S, *]). A submatrix M[S,T] is sometimes written as M$,t 
when this enhances legibility. The i th row (column) of M is denoted M^* (M*^). An entry of M 
is denoted Mjj. A submatrix containing rows {a,...,b} and columns {c,...,d} is denoted M a: b jC: d. 
The submatrix obtained by deleting row i and column j (row-set I and column-set J) from M is 
denoted M^^^ (M^ e uj n). If u and v are vectors and c is a scalar, the matrix M = M + cuv T 
is called a rank-1 update of M. Assume that M is non-singular and let a = c" 1 + v T M~ 1 u. The 
inverse of M exists iff a / 0, and equals 

M- 1 = M- 1 - a- 1 (M- 1 u) (v T M- 1 ), (1) 

which is itself a rank-1 update of M -1 . The following expansion of the determinant is useful. 

Fact 1 (Generalized Laplace expansion). Let M be an n x n matrix. Fix a set of rows / such that 
0^7 C {l,...,n}. Then 

detM = Yj detM[I,J] -detM[I, J] • (-i)E ie i *+E je j i_ ( 2 ) 

JC{1,. ..,n}, | J\=\I\ 

An analogous statement holds for a set of rows / by taking the transpose of A. 

Proof. See Aitken [2] or Murota [27]. ■ 
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A matroid is a tuple M = (V,I,B,r) where V is the ground set, X C 2 V is the collection of 
independent sets, B C X is the collection of bases, and r is the rank function. The rank of the 
matroid M is r(V). Together with V, any one of X, B, and r is sufficient to specify the matroid, 
so we do not necessarily mention all of them. To emphasize connection to a specific matroid, we 
sometimes use the notation Xm, j6m an d nvi- For J C V, M/J denotes the matroid obtained from 
M by contracting J. Its rank function is r^/j(S) := r-\i(J[JS) — r^/i(J). For J C V, M\ J denotes 
the deletion of J from M (or restriction of M to V \J). The direct sum of two matroids Mi and 
M2 on disjoint ground sets is denoted Mi © M^. The free matroid on a set V is F(V) := (V,jB) 
where B = {V}. 

2 Path-Matchings 

2.1 Definitions 

An instance of the a basic path- matching is a tuple G = (Xi, T2, S 1 , X 1 , Mi, M2) where (XiUX^US*, X) 
is a graph and each Mj is a matroid (Xj,Xj,rj). The vertex sets Xi, X2 and 5 are disjoint and 
furthermore Xi and X2 are stable sets (no edge has both endpoints in either Xi or X2). Let s = \S\ 
and assume that |Xi| = | T2 1 = t, and that Mi and M2 have the same rank r. 

A path-matching is a collection of node-disjoint paths with one endpoint in Xi and the other 
in X2, together with a matching on the S- vertices not contained in any of the paths. If M C E 
is a path-matching, let d^M C Xj denote the set of vertices in Xj that are covered by M, and let 
dsM C S denote the covered 5-vertices. A perfect path-matching is a path-matching M such that 
diM = T{ and dsM = S. A basic path-matching (bpm) is a path-matching such that diM G B^ 
and that d s M = S. 

2.2 Contracted Instances 

Let G = (Xl, X2, S, E, Mi, M2) be an instance of the basic path-matching problem. A set M C E 
is called an extensible set for G if there exists a bpm M' D M. Let M be an extensible set and 
note that diM G Xm,- We will now define a basic path-matching problem G(M), which we call the 
contraction of G by M. 

Define diM = Xj \ <9jM, and note that 3iM and d^M are not necessarily equicardinal. Let Pj 
be the set of paths in M with one endpoint in Xj and the other in S. The set Cj C S 1 consists 
of the endpoints in 5 of the paths in p. Define X/ := <9~M U d and S' := S\ d s M. Define 
X' C X to be the set of edges with both endpoints in S' or with one endpoint in T[ and the other 
in S' U Xj where i ^ j. The matroid M^ is (Mi /diM) © F(Cj). The contraction of G by M is 
G(M) := (X^X^^X'^'^M^). 

We now observe certain properties of G(M). First, C{ n S" = since Cj C dsM . Second, the 
ground set of M^ is T[ and furthermore r M /(X/) = r. Now let p be the number of paths in M 
connecting Xi to X 2 . Then |X/| = |X| - p since |c?iM| = \Ci \ + p. Next, if M' D M is a bpm for 
G then M" := M' \ M is a bpm for G(M). Conversely, if M" is a bpm of G{M) then M U M" is 
a bpm of G. This follows because every basis of M^ clearly contains Cj, and hence every bpm of 
G(M) covers Q. 
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2.3 Algebraic Structure 



Suppose that each Mj is a linear matroid representable over a common field F. Let Qi be an r x t 
matrix whose columns represent Mi over F and let Q2 be a t x r matrix whose rows represent M2 
over F. For notational convenience, we will let Qf denote Qi[*, J] and Q'2 denote Q2[J, *]■ We now 
define a matrix of indeterminates X which describes the graph underlying G. The rows of X are 
indexed by T\ U S and the columns are indexed by T2 U S. The entries are defined as Xij = ±x^jy, 
where the signs are chosen such that X[S, S) is skew-symmetric. 

Lemma 2 (Geelen [20]). X[S, S] is non-singular iff G has a perfect path- matching. 

We now extend this algebraic characterization to basic path-matching problems. Let Dj be a 
diagonal matrix of size \d{M\ whose entries contain distinct indeterminates. Define Z(M) to be 
the following matrix: 



Z(M) 



Q d 2 M 



\ 



and for convenience let 



Qf lM Q° 



diM 



Do 



Dx X[d 1 M,d 2 M] X[d 1 M,C 2 US'] 

X[CiUS',d2~M] X[CiUS',C2US']J 



Do 



X'=\D 1 X[diM, d 2 M] X[diM, C 2 U S'} 
X[dU S',d2~M] X[CiUS , ,C 2 US r \ 



Theorem 3. Let M be a set of edges in G such that d\M G J Ml and d 2 M G 2"m 2 - Then G(M) 
has a bpm iff Z(M) is non-singular. 

Proof. We apply the generalized Laplace expansion (Fact 1) to Z{M) with the set of rows 
/ = {1, r}, obtaining 

detZ(M) = ±detZ(M)[I,A] ■ det Z(M)[I, A] (3) 

Ac[k],\A\=r 

Clearly A must be a subset of the columns of Q\ and also A D d±M, otherwise either Z(M)[I, A] 
or Z(M)[I,A] will contain a zero column. So Eq. (3) may be rewritten 

detZ(M) = Yl ±detQi[*,A(J d±M] ■ det Z(M)[l,A~Ud^M]. 

ACdiM 
\A\=r-\diM\ 

We now apply the Laplace expansion to Z(M)[I, A U d±M] with the set of columns /' = {1, r}. 
A similar argument yields 

detZ(M) = Y ±detQ 1 [*,AUd 1 M] ■ Yl ± det Q 2 [B U d 2 M, *] • det X'[B, A] (4) 

ACthM BCd^M 
\A\=r-\d!M\ \B\=r-\d 2 M\ 
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Consider now the matrix X'[B, A]. We may use the remaining entries of D\ to clear their rows, 
and similarly for D 2 . The resulting matrix has the form 

\ 

X[A,B] X[A,C 2 US'} 
X[dU S',B] X[C 1 US',C 2 US']J 

where D[ = (-Di)dei(/l,A) and D 2 = (D 2 ) de ^ BB y To simplify our notation, let Y(A, B) denote 
J[AuCiU5', BUC 2 US'}. Clearly det X'[B,~A] = idetZ^ • det D' 2 ■ detY(A, B). 

The factor det D[ - det D 2 is a product of distinct indeterminates. The crucial observation is that 
this these products are distinct for each choice of A and B. Therefore there can be no cancellation 
among the terms of Eq. (4), so it follows that Z(M) is non-singular iff there exist A C d\M and 
B C d 2 M such that Qi [*, AUdiM], Q 2 [BUd 2 M, *], and Y(A, B) are all non-singular. Equivalently, 
we must have AliCi £ £>m' i5 BuC 2 <G £>m' 2 , and Y(A, B) non-singular. By Lemma 2, this is precisely 
the condition that G(M) has a bpm. ■ 

By the discussion of Section 2.2, an equivalent statement of Theorem 3 is as follows. 
Corollary 4. M is an extensible set iff Z{M) is non-singular. 

2.4 Allowed Edges 

Suppose that M is an extensible set. We say that an edge e = {i,j} in G(M) is allowed (relative 
to M) if M + e is also extensible. This section explains how to test efficiently whether e is allowed. 
It will be convenient to identify the vertices T[ U S' (vertices T 2 U S') with rows (columns) of the 
submatrix of X in Z(M). Also, let N = Z(M)~ l . There are two cases. 

(A) : Either i or j is in T{UT 2 , say i G T[ and j € T^US'. Then Z(M + e) = Z(M) dcl(iJ) , modulo 
a permutation of the rows and columns. Therefore, e is allowed iff Nj i ^ 0, since det Z{M + e) = 
± det Z(M) del (jj) = ± det Z(M)-Nj^. Furthermore, the matrix Z(M+e)^ 1 may easily be computed 
from N. It is known (e.g., [25, Theorem 3.1]) that 

ZiM + e)' 1 = (N - N^-iN^-N^) (5) 

This is a rank-1 update and in fact Eq. (5) can easily be derived from Eq. (1). 

(B) : Both i and j are in S'. Then Z(M + e) = Z(M)^ e uu j\ s i j\y Jacobi's theorem (see, e.g., 
Gantmacher [19, §1.4]) implies that det Z(M) de unj\uj\\ = ±det N[{i, j} , {i, j}}. Therefore, e is 
allowed iff det N[{i, j} , {i, j}] ^ 0. As before, the matrix Z(M + e) _1 may easily be computed 
from N. A simple extension of Eq. (5) shows that 

Z ( M + ^ = - N *^} ■ (%-M)" 1 • ^>) dd({w}>{w}) - (6) 

3 Basic Path-Matching Algorithm 

In this section we develop an efficient algorithm for the basic path-matching problem. Let G = 
(Ti, T 2 , S, E, Mi, M2) be an instance, and let n = |Ti| + \T 2 \ + \S\. We use the following general 



X" 



D' 2 
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approach. We start with the matrix Z and randomly substitute values for the indeterminates 
from the field F, or a sufficiently large extension. Operating in an extension field increases the 
running time by a factor of 0(logn • poly (log log n)). By standard arguments, which we omit, this 
random substitution does not affect the rank of Z with high probability. If Z is singular then 
Theorem 3 implies that no basic path-matching exists (whp). So assume that Z is non-singular. 
The algorithm computes Z -1 , which requires only 0(n u ) time. Let ./V be the transpose of the 
lower-right submatrix of Z^ 1 (corresponding to the matrix X). 

The algorithm searches for allowed edges, following the approach of Section 2.4. Once an allowed 
edge {i, j} is found, we set M = { {i,j} } and construct a bpm for the contracted instance G(M). 
This requires constructing the corresponding matrix Z(M) _1 , which can be done via the updates 
in Eq. (5) and Eq. (6). Performing these updates immediately takes 0(n 2 ) time and would lead to 
an 0(n 3 ) algorithm. Instead, we will store the parameters of the updates that are to be performed 
and only apply the updates to portions of N that will be examined soon. An update that has yet 
to be applied is called pending. A submatrix for which no updates are pending is called clean. 

Each update involves subtracting from Z(M)~~ 1 the product of three matrices u, c and v where 
u is n x k, c is k x k, v is k x n, and k £ {1,2}. Furthermore, u, c and v are submatrices of 
Z(M)^ 1 (at the time of the update), and the submatrices involved in different updates are disjoint. 
Therefore we may represent all updates by three n x n matrices U, C and V, and two lists tt c and 
ir r . U stores the n-matrices, C stores the c-matrices, and V stores the -u-matrices; they are all 
initially zero. The list ir c specifies which columns of U and C are involved in each update, and 7r r 
similarly describes the rows of V and C. 

The algorithm proceeds recursively, starting from the graph G with vertex sets T\, T2 and S. For 
convenience, we imagine augmenting the submatrix X so that its row-indices and column-indices 
are identical: we add empty columns corresponding to T\ and empty rows corresponding to T2. 
Next, we arbitrarily partition T1UT2US into a > 3 equal-sized parts Vi,...,V a , where the value of a 
will be chosen later. For each unordered pair of parts {V a , Vt}, we recurse on the subproblem with 
vertex set V a U V\>. A leaf of the recursion considers a 2 x 2 submatrix N[{i, j} , {i, j}]; furthermore, 
every such submatrix is considered by at least one leaf. 

The algorithm maintains the invariant that the submatrix N[V a U V5, V a U V&] is clean whenever 
it starts or finishes solving the subproblem for vertex set I4UVfe. Consider the a leaf of the recursion 
tree for a pair of vertices {i,j}. As in Section 2.4 there are two cases. (A): i G T\. The edge {i, j} 
is allowed iff Nij 7^ 0. The case where j <G T2 is analogous. (B): Both i and j belong to S. The 
edge {i, j} is allowed iff det N[{i,j} , {i,j}] / 0. 

If {i,j} is allowed, we perform the following steps. First, we adjoin {i,j} to M. Next, the 
parameters of the update (either Eq. (5) or Eq. (6), depending on whether case (A) or (B) applies) 
are stored in the matrices U, C and V; however, matrices U and V are updated only conceptually. 
To satisfy our invariant we need only ensure that the submatrices of U, V and iV corresponding 
to the current subproblem are updated. The lists ir c and 7r r are also updated to indicate which 
columns and rows are involved in this update. 

Suppose that the subproblem with vertices V a L)Vb has just been processed, and now we continue 
processing its "parent" in the recursion tree, which we denote by G. Any update generated during 
processing of the subproblem is pending. Suppose that iV has the block form = ( I ^ f NE ) 



where, without loss of generality, the vertices V a U V5 correspond to the "north-west" submatrix. 
Assume that U, C and V have the same block form. The submatrix Njyw is clean by our invariant 
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but the other three submatrices of N are not, nor are Usw an d Vne- To update Usw-> let U = N$Wi 
C = CnWi an d let V be a copy of Vmw where rows that do not correspond to pending updates 
are set to zero. Assume that these matrices are ordered according to ir c and ir r . Observe that V is 
strictly upper triangular, since columns participating in an update are set to zero. 

To compute Usw-, we will employ two tricks. The matrix C is a block-diagonal matrix with 
lxl and 2x2 blocks. The 2x2 blocks correspond to updates of type (B), which are rank-2 
updates. Such an update can be unfurled into four rank-1 updates as follows: 



Cl,l Ci )2 
C2,l C 2) 2 



v 2 



/ci,i 



U\ U2 U2 Ui 



C2,2 



V 



c l,2 



C2,ly 



— Ul — 

— «2 " 

— Ul — 

\ — v 2 — , 



(7) 



We now wish to apply all of these updates. The difficulty is that the columns of U involved in the 
j th update may be modified by the i th update if i < j. To surmount this difficulty, we use the 
following lemma. 

Lemma 5 (Sequential Update Lemma). Let X and Y be n x n matrices where Y is strictly 
upper triangular. Define a sequence of matrices by X^ = X and X® = X^ + x[\ 1] ■ Y it * for 
1 < i < n. Let X <g> Y denote the matrix X^. Then X ®Y = X ■ (I - Y)' 1 . 

(i) (k) 

Proof. First, note that X^: = X^ / if i < j < k since Y is strictly upper triangular. Define 



A 



I-Y 
X I 



and consider performing Gaussian elimination on A. Let denote the south-west submatrix of A 
just before the i th elimination. An easy inductive argument shows that sj?\. n = X^\. n . The (lower 

half of the) column vector involved in the i th elimination is therefore sjf\ = X^J. Now consider 
the LU-decomposition of A: 



I-Y 
X I 



I 
B I 



I-Y 
/ 



It is well-known that -B*^ is precisely the (lower half of the) column involved in the i th elimination 
(see, e.g., Strang [32]). Thus B = X^ = X <g> Y. The lemma follows by observing that X = 
B-(I-Y). U 

We can therefore compute Nsw = U ® ( — C ■ V) in 0(n w ) time by the sequential update lemma. 
The columns from Nsw corresponding to pending updates are then copied into Usw and set to 
zero. This makes Nsw an d Usw clean. A symmetric argument explains how to make Vne and Nne 
clean. Finally, let U, C and V denote the submatrices of Usw, Cnw an d Vne corresponding to 
the pending updates. We make Nse clean by setting Nse = Nse — UCV (cf. Eq. (5) and Eq. (6)). 
All matrices (within the current subproblem) are now clean, and the recursion may continue. 

Starting from an instance with n vertices, this algorithm recurses on (2) subproblems with ^ 
vertices. After solving each subproblem, the algorithm ensures that the matrices of the parent 
subproblem are clean. So the time required satisfies the recurrence 



Tto = ®-r(^)+0((?)-n w ). 



(8) 
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The solution of the recurrence is T(n) = 0{n^) if a is a constant chosen such that log a / 2 (2) < oj. 
Since log Q / 2 (2) < 2 + i og 1 a _ 1 , there exists an appropriate choice of a, assuming that oj > 2. If 
oj = 2, this analysis gives an 0(n 2+e ) bound, for any e > 0. We may obtain an improved bound by 
modifying the algorithm to keep a bit vector to ensure that each recursive subproblem is entered 
at most once. Thus at level i of the recursion there will be ( a(a / 2)< ) < 2(a/2) 2i+2 subproblems, 
and the time for applying updates is 0((n(a/2)~ t+1 ) u '). Since there are at most log a / 2 n levels of 
recursion, this yields a bound of 0{n w logn) which is valid even if oj = 2. 

4 Applications 

In this section we discuss several applications of basic path-matchings and our preceding results. 

4.1 Non-bipartite Matching 

The problem of finding a perfect matching in a general graph G is the special case of the basic 
path-matching problem where Ti = T 2 = 0. Let n = \S\ be the number of vertices in the graph. 
In this case, an extensible set M is a matching that is contained in a perfect matching. The 
matrix Z{M) is simply the Tutte matrix restricted to the vertices exposed by M, and is therefore 
skew-symmetric. This fact has useful implications. 

First of all, we may wish to find a maximum matching instead of a perfect matching. To 
do so, we first choose a random substitution for the indeterminates in Z, then find a full-rank 
submatrix, say Z[A, B], by the Hopcroft-Bunch [4] algorithm. Lovasz [23] showed that |^4| is twice 
the cardinality of a maximum matching. Furthermore, it is known that Z[A,A] also has full rank 
(by Frobenius' theorem [29] or the Grassman-Pliicker identity [27, p434]). This implies that G[A] 
has a perfect matching, so we may apply the algorithm of Section 3 to G[A\. 

Since Z(M) is skew-symmetric, the matrix N = Z(M)~ l is as well [8, p521]. Therefore, every 
2x2 submatrix inspected by our algorithm is of the form ( _° x § ) . Consequently, Eq. (6) simplifies to 
the sum of two rank-1 updates, and the transformation of Eq. (7) becomes unnecessary. Moreover, 
this submatrix has non-zero determinant iff x ^ 0. This simplifies the test of Section 2.4 (B) and 
also yields an alternative proof of [29, Lemma 3]. 

4.2 Bipartite Matroid Matchings 

An instance of the bipartite matroid matching problem is a tuple G = (T±, T 2 , E, Mi, M2) where 
E C T\ x T2 and Mj = (Tj,Xj,rj) is a matroid. An independent matching in G is a matching 
MCE such that d\M € Z\ and c^M <G T^. The objective is to find a maximum cardinality 
independent matching. The bipartite matroid matching problem is clearly a special case of the 
basic perfect-matching problem where S = 0. Theorem 3 extends to show that the maximum 
cardinality of an independent matching in G(M) is rankZ(M) — 2n. (Murota's characterization of 
the rank of mixed matrices [27] is convenient for proving this extension.) 

Additionally, the algorithm of Section 3 immediately solves the bipartite matroid matching 
problem in 0{n u ) time. When applied to bipartite matroid matching problems, this algorithm 
amounts to executing the Mucha-Sankowski bipartite matching algorithm on the lower-right sub- 
matrix of Z^ 1 . Thus we have the following surprising result: when the Mucha-Sankowski algorithm 
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executes on X and X~ , it computes a maximum bipartite matching. However, when it executes on 
X and Q = Q2(QiXQ2)~ 1 Qi (which is the lower-right submatrix of Z^ 1 ) it computes a maximum 
independent matching. 

A somewhat more efficient bound may be obtained by considering the rank r. The algorithm 
begins by considering the initial graph G and Q. We note that Q may be computed in 0(n 2 r w ~ 2 ) 
time. The vertex sets V\ and V2 are each partitioned into two parts. The algorithm recurses on all 
four ordered pairs of parts, and the corresponding submatrices of Q. Since since any independent 
matching has size at most r, the number of subproblems per level that generate an update is at 
most r. Furthermore, the size of the updates computed may also be parameterized by r. These 
observations lead to the bound 0(n 2 r w_2 + r w+1 ) which is tighter than O(n^) when r = o(n w /( a ' +1 )). 
The details are mundane and hence omitted. 



4.3 Matroid Intersection 

A matroid intersection problem is a bipartite matroid matching problem where the edges E in the 
bipartite graph are a matching. Alternatively, if Mi = {V,Z\) and M2 = (V,^) are two matroids, 
a set J C V is called an intersection if J G X\ nl2- A maximum intersection is one with maximum 
cardinality. If J C V is contained in a maximum cardinality intersection, it is called extensible. If 
J is extensible, i J, and J + i is also extensible then element i is called allowed. 

For matroid intersection problems, the matrix Z(M) can be greatly simplified, owing to the fact 
that all principal submatrices of X are non-singular. If J is an extensible intersection, we define 
the matrix Z(J) as 

/ q( Qi \ 

Z{J)=\Qi 

\Q 2 ^dci(j,j)/ 

By an argument similar to Theorem 3, it follows that the cardinality of a maximum intersection in 
Mi /J and M2/J is rankZ(J) + | J\ — \V\. This result was mentioned by Geelen [21] (and attributed 
to Murota [27]) for the special case that J = 0. 

The characterization of allowed elements for matroid intersection problems is as follows. We 
identify the elements of V with the rows and columns of the submatrix of X in Z(M). 

Lemma 6. Let J be an extensible intersection and let i G V \ J. The element i is allowed if and 
only if (Z(J)-\ t ^zT\ 

Proof. As mentioned above, J + i is extensible if and only if Z(J + i) is non-singular. By 
linearity of the determinant, det^(J + i) = detZ(J) — Zi ■ det Z(</)dei(i,i)- Since (Z(J)~ 1 )i : i = 
detZ(J) dcl(M) /detZ(J), we have det Z(J+i) = det Z{J) ■ (1 -Zi ■ (Z(J) _1 ) M ). Thus det Z{J+i) = 
^ Z(J)-J = z~\ U 

The structure of Z(J) will play a helpful role in our algorithms for matroid intersection below. 
The Schur complement of X de uj n is 

Y{J) . = ^-Qi-x^r i -Qi Qiy 
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For simplicity, let Y denote Y(®). One may verify that 

z(J)- 1 = 



Y(J)- 1 -(Y(jr% [r] .Q{.X(J)^ \ 

X(J)- 1 ■ Q J 2 ■ (Y(J)-% U . X(J)- 1 + X(J)- 1 • Q J 2 ■ (Y(J)-% }j [r] ■ Q( ■ X(J)- 1 ) 

(9) 



The matroid intersection algorithms developed below will use this structure of Z(J) 1 crucially. 
4.3.1 Matroid intersection algorithm I 

In this section we describe our first matroid intersection algorithm, which achieves running time 
0(nr aJ ~ 1 logr). We start with the matrix Z and randomly substitute values for the indeterminates 
from the field F, or a sufficiently large extension. Next, we restrict our attention to a full-rank 
principal submatrix of the form Z[A,A] with {r + 1, r + n} C A. Such a submatrix may be 
found in 0{mn u} ~ 1 ) time by the following scheme. First compute Y, which requires 0(nr u '^ 1 ) 
time. Now, since Y is the Schur complement of X, it follows that rankZ = ranky + n. Thus the 
submatrices of Y of rank k correspond to submatrices of Z with the desired form and rank k + n. 
A maximum rank submatrix of Y can be found in 0(r w ) time by performing Gaussian elimination. 

The present algorithm maintains an extensible matching, initially empty, and searches for al- 
lowed edges using Lemma 6. This seemingly requires computing the entire matrix Z -1 , which 
would require a prohibitive Q(n 2 ) time. Initially we work under the assumption that Z _1 has been 
completely computed, and later we will show that this assumption can be removed. We also assume 
that both n and r are powers of two. 

Suppose that an allowed element % has been found and we now wish to construct 
The matrix Z({i}) is identical to Z except that z\ has been set to 0. This can be expressed as the 
rank-1 update Z({i}) = Z + (— Zj)e r+ jeJ +i . The corresponding change to the inverse matrix is 

Z{{i})- 1 = Z- 1 - (z- 1 + (Z-V^r 1 (Z'^r+i (Z'^r+i,*. (10) 

Because the updates have this form, we can again adopt the approach of Section 3. To harmonize 
the notation, let N denote the lower-right submatrix (Z _1 ) r +i:r+n,r+i:r+n- As before, updates will 
be represented using matrices U, C and V. 

The algorithm is recursive, initially considering the entire vertex set V and the entire submatrix 
N. The vertex set is partitioned: let V\ be the first n/2 vertices and V2 the remainder. The 
algorithm recurses first on the subproblem V±, and then on Vi- However, the invariant used in 
Section 3 does not lead to an efficient algorithm for matroid intersection. We obtain an efficient 
algorithm through the following more relaxed invariants: 

(1) : When the recursion enters a subproblem with vertex set A then N[A, A] is clean if \A\ < r. 

If \A\ > r then the r x r blocks on the diagonal of N[A, A] are clean. 

(2) : For any subproblem, before its second child subproblem begins processing, then the Usw 

and Vne submatrices are clean. 

A leaf of the recursion tree considers only a single vertex, and simply checks if iVj j 7^ z~ x . If so, 
the element i is added to the intersection, and we conceptually perform the update of Eq. (10). To 
represent the update, we store its scalar parameter in C^j. 

Consider a subproblem at level % of the recursion tree. The number of vertices in this subproblem 
is n1~ % . Once its first child has finished, we must update Usw and Vne- This step is more involved 
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than in Section 3 because of our relaxed invariants. We say that an update is old if it was produced 
before entering subproblem V\. The difficulty is that old updates might not have been applied to 
Nsw- The easy case is when n2~ l < r. In this case, invariant (1) implies that the old updates have 
been applied. It remains to apply the pending updates that were generated within the first child 
subproblem. This is done as in Section 3, using only 0((n2^ t ) UJ ) time. 

The more difficult case is when n2~ % > r. In this case, there may be old updates which have 
not yet been applied to Nsw- I n fact, no updates have been applied to Nsw whatsoever. However, 
the portions of U and V which contain the updates relevant to Nsw are clean; this follows from 
applying invariant (2) to the parent of the current subproblem. The only portion of Nsw that is of 
interest is the submatrix corresponding to columns where updates were made. Let us denote this 
submatrix by N. Its size is at most n2~ % x r. The number of old updates which must be applied 
to N is also at most r. Therefore the time required to apply the old updates is bounded by the 
time to multiply an n2~* x r matrix by an r x r matrix, which is 0(n2~ t r M ~ 1 ) time. The updates 
generated within the first child subproblem can also be applied to Nsw in 0{n2~ l r M ~ 1 ) time by the 
sequential update lemma, as in Section 3. A similar argument applies to updating N^e and Vne- 
Finally, we must update Nse- To restore the invariants, we only need update its r x r diagonal 
blocks. This requires time only 0(n2~ l r M ~ 1 ) by the obvious approach. 

To analyze the time required by this algorithm, recall that there are 2 l subproblems at level i. 
For levels i < log(n/r), the total time required is X^i™^ 2 l ■ 0{n2~ t r u) ~ 1 ) = 0(nr w ~ 1 logn). For 
levels i > log (n/r), the time is 

log n log n log r 

2 i ■0{{n2- i ) u) )= /Z 0(n w 2-^-^) = ^20(n2^-^) =0(nr w - 1 ), 

i=log(n/r) i=log(n/r) i=0 

so the total time required is (^(nr^ -1 ). 

The preceding discussion assumes that N was initially fully computed. However, it is clear 
that the only parts of N that are needed are those in the r x r diagonal blocks and those that 
are involved in updates. It is straightforward to extend the algorithm so that the necessary parts 
of N are computed on demand. At the beginning of the algorithm, we compute only the r x r 
diagonal blocks of N. As shown by Eq. (9), this requires multiplying QjY~ 1 Qi plus some negligible 
additional work, and hence 0(nr UJ ~ 1 ) time is required. Next, consider performing an update where 
n2~ l > r. Then we must compute the initial submatrix iV (before any updates are applied). Recall 
that N has size at most n2~ % x r. Then, refering again to Eq. (9), we see that N can be computed 
in 0(n2 - V w-1 ) time. Thus the preceding analysis applies without change. 

4.3.2 Matroid intersection algorithm II 

We conclude the paper with a simpler matroid intersection algorithm that is efficient if n is suf- 
ficiently large, or if we restrict ourselves to naive matrix multiplication. As usual, the algorithm 
chooses random values for the indeterminates in Z from a sufficiently large field extension, and 
restricts attention to a full-rank submatrix. The algorithm relies on the fact that the diagonal 
elements of Z' 1 can be computed quickly, and can also be updated as elements are added to the 
extensible intersection. For convenience, let y(J) denote (Y( J) _1 )[ r ] ) [ r j. As is evident from Eq. (9), 
computing the diagonal elements of Z{ J) -1 involves computing the matrix y. The following lemma 
shows how adding an element to the intersection changes y. It is obvious that the change is a rank-1 
update, but less obvious that the parameters of the update exclusively involve y, Qi and Qi- 
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Lemma 7. Let J be an intersection and let i J be such that J + i is extensible. Let u = 
and let « = (Qj)*,.. Then }>( J + i) = y{J) - (v T y(J) u)~ l ■ Q>(J) u) ■ (v T y(J)). 

Proof. Note that Z(J + i) = Z(J) + (-Zi)e r+i eJ +i . Hence 

Z(J + i)- 1 = Z(J)- 1 - a' 1 ■ (Z(J)' 1 e r+1 ) ■ (ej +l Z( J)" 1 ), (11) 

where a = {—z~ l + (Z( J)~ 1 ) r +i,r+i)- Using the structure of Z(J)~ 1 in Eq. (9), we see that 
(Z( J)~ l ) r+ i )r+ i = z~ l {\ + z~ 1 v T yuz^ 1 ), so a = z~ 2 v T yu. Now, restricting Eq. (11) to the first r 
rows and columns, we obtain 

y(J + l) = ^(J)-a-(^(J)- 1 )l:r,r+ t -(^(J)" 1 )r+ l ,l : r. 

Again using the structure of Z( J) -1 , we see that {Z{J)~ l )\-_ r ^ r+ i is the (i— j) th column of — (y -1 )i :T -,i:r" 
• ^(J) _1 , which is -z^ 1 } 1 !!. Similarly, (Z( J)~ 1 ) r +i,i-.r = —z~ l vy, establishing the lemma. ■ 

The algorithm begin with J = and computes the initial matrix y(J) = Y^ 1 , which requires 
0(nr w_1 ) time, as observed above. Using y(J), we may efficiently compute the diagonal elements of 
Z(J)~ l in rxr blocks. Each block requires 0{r w ) time. Computing the first diagonal block allows us 
to determine if an element i among the first r is allowed, using Lemma 6. If so, we compute y(J+i) 
using Lemma 7 and recompute the current block so that it becomes a submatrix of Z(J + i)^ 1 
rather than Z(J)" 1 . As is evident from Lemma 7 and Eq. (9), both of these computations are 
rank-1 updates and hence require only 0(r 2 ) time. The algorithm adjoins i to J and repeats this 
process until \J\ = r. 

We now analyze the time required by this algorithm. Computing a block of r diagonal elements 
of Z(J) -1 requires only 0(r w ) time. Since there are \n/r\ such blocks, the total time for these 
computations is 0(nr w_1 ). After adding each element to J, we compute y(J + i) and recompute 
the current block in 0(r 2 ) time. Since \ J\ = r at the completion of the algorithm, the total time 
for these updates is 0(r 3 ). Thus the running time of the algorithm is 0(nr w ~ 1 + r 3 ). This bound 
is Ofar^^ 1 ) for n = r2(r 4 ~ w ). If only naive matrix multiplication is used, and the field size is 
sufficiently large, we obtain a bound of 0(nr 2 ). 
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